AITopics | additional detail

Collaborating Authors

additional detail

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AThe Noisy Quadratic Setting Additional Details

Neural Information Processing SystemsApr-24-2026, 20:33:17 GMT

In this section we extend our discussion of the noisy quadratic model (NQM). We first discuss stability in the NQM. We then provide proofs for the results in Section 4. We also extend our discussion of robust stability and the stability of models with hidden states. A.1 Stability In this subsection we expand on the short discussion of key stability results in the body of the paper. We will primarily discuss stability of the nominal system.

artificial intelligence, figure app, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

be2e1b68b44f2419e19f6c35a1b8cf35-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-17-2026, 21:02:51 GMT

data mining, large language model, machine learning, (27 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.27)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Russia (0.14)
(92 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
(2 more...)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (1.00)
(10 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Information Management (1.00)
Information Technology > Human Computer Interaction (1.00)
(8 more...)

Add feedback

Supplementary Material Hardware Resilience Properties of Text-Guided Image Classifiers This section contains supplementary material that provides additional details for the main paper and

Neural Information Processing SystemsFeb-17-2026, 08:36:22 GMT

Note that for error injection experiments, we perform single-bit flips only in the convolutional and linear layers of the neural network, in line with other work in this field. In this section, we provide visualizations of additional backbones. Figure 9 and Figure 10 extend from Figure 3 for more networks. The Y -axis shows the absolute value of the max neuron value observed per layer on the X-axis. Next, Figure 11 and Figure 12 are extensions for Figure 4, showcasing the impact of our proposed technique on the end-to-end network accuracy.

artificial intelligence, class label, machine learning, (10 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.54)

Add feedback

ae2d574d2c309f3a45880e4460efd176-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 13:17:41 GMT

artificial intelligence, imagenet-vid, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback

17256f049f1e3fede17c7a313f7657f4-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 14:47:17 GMT

artificial intelligence, machine learning, robustness, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.69)

Industry: Information Technology > Security & Privacy (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Scaling Diffusion Transformers Efficiently via $μ$P

Zheng, Chenyu, Zhang, Xinyu, Wang, Rongzhen, Huang, Wei, Tian, Zhi, Huang, Weilin, Zhu, Jun, Li, Chongxuan

arXiv.org Artificial IntelligenceNov-3-2025

Diffusion Transformers have emerged as the foundation for vision generative models, but their scalability is limited by the high cost of hyperparameter (HP) tuning at large scales. Recently, Maximal Update Parametrization ($μ$P) was proposed for vanilla Transformers, which enables stable HP transfer from small to large language models, and dramatically reduces tuning costs. However, it remains unclear whether $μ$P of vanilla Transformers extends to diffusion Transformers, which differ architecturally and objectively. In this work, we generalize standard $μ$P to diffusion Transformers and validate its effectiveness through large-scale experiments. First, we rigorously prove that $μ$P of mainstream diffusion Transformers, including U-ViT, DiT, PixArt-$α$, and MMDiT, aligns with that of the vanilla Transformer, enabling the direct application of existing $μ$P methodologies. Leveraging this result, we systematically demonstrate that DiT-$μ$P enjoys robust HP transferability. Notably, DiT-XL-2-$μ$P with transferred learning rate achieves 2.9 times faster convergence than the original DiT-XL-2. Finally, we validate the effectiveness of $μ$P on text-to-image generation by scaling PixArt-$α$ from 0.04B to 0.61B and MMDiT from 0.18B to 18B. In both cases, models under $μ$P outperform their respective baselines while requiring small tuning cost, only 5.5% of one training run for PixArt-$α$ and 3% of consumption by human experts for MMDiT-18B. These results establish $μ$P as a principled and efficient framework for scaling diffusion Transformers.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.1527

Country: Asia (0.28)

Genre: Research Report > New Finding (0.46)

Technology: